A GRAPH COUNTEREXAMPLE TO DAVIES' CONJECTURE 



GADY KOZMA 



Abstract. There exists a graph with two vertices x and y such that the ratio of 
the heat kernels p(x,x; t) / p{y,y; f) does not converge as t — > oo. 



This paper is concerned with a conjecture of Brian Davies from 1997 on the heat 
kernel of Riemannian manifolds [D97, §5]. We will not disprove the conjecture as 
stated, but rather transform it to the realm of graphs using a well-known (though 
informal) "dictionary" between these two categories, and build a graph that will 
serve as a counterexample. We will make some remarks on how the construction 
might be carried over back to the category of manifolds in the end, but we will not 
give all details. The bulk of this paper is about graphs. 

We start by describing the conjecture in its original setting. Let M be a con- 
nected Riemannian manifold, and let p be the heat kernel associated with the Laplace- 
Beltrami operator on M. Then Davies conjecture states that for any M and any 3 
points x, y, z 6 M the limit 

lim 4^ (1) 

M-oo p(z, z;f) 

exists and is positive. Here p(x,y; t) is the value of the heat kernel at time t and at 
points x and y. This property is known as the "strong ratio limit property" (where 
the "weak" version is an averaged result due to Doblin, [D38]) or SRLP for short. 
So Davies' conjecture is that in these settings SRLP always holds. SRLP holds for 
manifolds with one end [D82] and for strongly Liouville manifolds (i.e. manifolds 
where any positive harmonic function is constant), see [ABJ02, corollary 2.7] who 
also make interesting connections between these properties and the infinite Brow- 
nian loop. 

Ratio limit properties were considered for Markov chains even earlier. If M is 
any Markov chain on a countable state space, then we say that M satisfies SRLP if 
(1) holds for any three states x, y and z, where p{x, y; t) is the probability that the 
Markov chain started at x will be at y at time t. For general Markov chains there 
are a few examples where SRLP does not hold. Clearly it does not hold when the 
Markov chain has some kind of periodicity. F. J. Dyson constructed an example 
of an aperiodic recurrent Markov chain which does not satisfy SRLP [C60, part 
I, §10]. That example utilizes long chains of states with only one outgoing edge, 
which the walker must traverse sequentially. In particular it is not reversible. 

Now, the Laplace-Beltrami operator is self-adjoint so a proper analog of Davies 
conjecture needs to assume that the Markov chain is reversible. Reversible Markov 
chains are also known as random walks on weighted graphs. The issue of pe- 
riodicity can be dealt with by looking at random walk in continuous time or at 
lazy random walk. Lazy random walk is a walk where the walker, at every step, 
chooses with probability j to stay where it is, and with probability j moves to one 
of the neighbours (with probability proportional to the weights). 
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The result is this paper is 

Theorem. There exists a connected graph G with bounded weights and two vertices 
x,y 6 G such that the heat kernel of the lazy random walk satisfies 

V{x,x;t) ^ 

as t — > oo. 



Let us remark on the "bounded weights" clause. When doing analogies be- 
tween manifolds and graphs, it is often assumed that the manifold has bounded 
geometry and the graph has bounded weights. Davies, however, explicitly does 
not make the assumption of bounded geometry. Thus one might wonder what 
is exactly the graph analog. All this is mute, of course, since the counterexample 
does has bounded weights (and hence a manifold example constructed along the 
same line should have bounded geometry). 

It is easy to see that in these setting (reversible, irreducible, lazy) this ratio must 
be bounded between two constants independent of n. Hence if it does not con- 
verge then it must fluctuate between two values. The proof constructs the graph 
with two halves, denoted H e and H° (e and o standing for even and odd, H stand- 
ing for half), which are connected by one edge, {x, y). On the "odd scales", H e will 
"look like Z 22 " while H° will "look like 1?" . This means that to get from itox 
(where x is on the H e side), the most beneficial strategy is to move to y as fast as 
possible, spend most of your time on the H° side and return to the H e side only in 
the last minute. Clearly this would mean that p(x, x; t) is smaller than p(y, y; t) as 
the random walk starting from y can stay on its side at all time, not losing the con- 
stant that x needs for the maneuver. On the "even scales" the picture is reversed 
and y is at a disadvantage. See figure 1 — drawing in 22 dimensions might have 
distracted the reader, so the figure demonstrates the construction in 1 and 2 dimen- 
sions. The smallest two braces in the figure are the first scale, in which H° is really 
one dimensional and H e is really two-dimensional. The larger two braces indicate 
the second scale. This time H° is a network of lines so it should be thought about 
as two dimensional, while H e is a thick column, so it should be thought about as 
one dimensional. The third scale is only hinted at in this figure, but one can imag- 
ine that H° now becomes a thick band, so it is again one-dimensional, while H e 
becomes a network of these thick columns and bands, so it is back to being two 
dimensional. 

As one might expect, the numbers 3 and 22 have no particular significance. 
They both have to be > 2, since otherwise our graph would be recurrent and 
recurrent graphs always satisfy SRLP [061, theorem 3]. And of course they have 
to be different. We took here the large value 22 in order to be able to be wasteful in 
various points (sum over times and such stuff), but the proof could proceed with 
any value larger than 3. 

This paper was first written in 2006. I wish to take this opportunity to apologize 
to all those who has to wait so long for it to appear, with no real reason. My 
intentions were good but my time management was abysmal. I wish to thank 
Yehuda Pinchover for telling me about the problem and for reading early drafts. 
Partially supported by the Israel Science Foundation. 
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FIGURE 1. The graphs H° and H e 



1. Proof 

The construction uses Z rf -like graphs as building blocks, so we start with quot- 
ing a few results on these. We first recall the notion of rough isometry [K85]. 

Definition. Let X and Y be two metric spaces. We say that X and Y are roughly 
isometric if there is a constant C and a map q> : X — » Y with the following proper- 
ties: 

(i) For all x and y in X, 

1 

-d(x,y) - C < d{cp(x), cp(y)) < Cd(x,y) + C 

(ii) The image of (p is roughly dense, i.e. for all y E Y there is an x 6 X such 
that d(y, cp(x)) < C 

If G and H are graphs we say that they are roughly isometric if they are roughly 
isometric when considered with the metric d being the graph distance, namely 
d(x,y) is the length of the shortest path between x and y, or oo if no such path 
exists. 
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With this definition we can state the following standard result, essentially due 
to Delmotte. 

Lemma 1. Let Gbe a graph roughly isometric to Z d . Then the heat kernel p for the lazy 
walk on G satisfies, for all t > 1, 

cr dn < p(x,x;t) < cr d/1 . 

Here G is a simple graph — we do not allow weights or multiple edges. C and 
c are constant which do not depend onf. In general we will use c for constants 
which are small enough and C for constants which are large enough, and different 
appearance of c and C might relate to different constants. 

Proof. By Delmotte's theorem [D99] any G which satisfies volume-doubling and 
the Poincare inequality, satisfies 

1 

p(x,x;t) « — — 

where B{x,r) is the ball around x with radius r (again with the graph distance), 
and \ B(x, r) \ is the sum of the degrees of the vertices in B. The notation X ~ Y is 
short for cY < X < CY. The fact that G is roughly-isometric to Z d gives 

\B(x,r)\*r d (3) 

so we would get p(x,x;t) ~ t~ d ^ 2 , as needed. So we need only show that G 
satisfies volume doubling and Poincare inequality. 

Now, the definition of volume doubling is that for every x a vertex of G and 
every r > 1 

\B(x,2r)\ < C\B(x,r)\ 

and it follows immediately from (3). The Poincare inequality is not much more 
complicated. The definition is: for every vertex x, for every r and for every func- 
tion/ : B(x,2r) -> R, 

E de S (y)\f(y)-f\ 2 <Cr 2 £ (/(y)-/W) 2 (4) 

y£B(x,r) (y,z)GE(B(.r,2r)) 

where 

_ i 

f= lB(x r] \ E de §(y)/(y)- 

and where deg(y) is the degree of y, and E(B) is the set of edges both whose 
vertices are in B. Now, Z d satisfies the Poincare inequality (see e.g. [PSC, §4.1.1]). 
It is well-known and not difficult to see that the Poincare inequality is preserved 
by rough isometries (it uses the fact that D deg(y) |/(y) — a\ 2 is minimized when 
a = f). This finishes the proof. □ 

Lemma 2. Let Gbe a graph roughly -isometric to d > 3 and let x be some vertex. Let 
p be the probability that lazy random walk starting from x returns to xfor the first time at 
time t. Then p > ct^ d ^ 2 . 

Proof. Let p\ be the same probability but without the restriction that this is the first 
return to x. This is exactly p(x,x;t) and by theorem 1 we have p\ > ct~ d ^ 2 . Fix 
some K and examine the event that the random walk returns to x at t and also at 
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some time s £ [K, t — K] . Let pi be its probability. Using the other direction in 
theorem 1 we can write 

t-K t-K 

Pi < E p(x,x;s)p(x,x}t-s) < Cj^s- d/2 (t- S y d/2 < C¥}- dll r An . 

s=K s=K 

Since d > 3 we can choose K sufficiently large such that p2 < \p\- So we know 
that with probability p\ — P2 > ct~ d ^ 2 the walk does not return to x between K 
and t — K. If it does reach x before time K, do some local modification so that it 
does not. For example, if the original walker reached x at some time s < K and on 
the next step went to some neighbour y of x, modify it to walk to y in the first step 
and stay there for s steps (remember that our walk is lazy) and then continue like 
the original walker. Clearly this "costs" only a constant and ensures our walker 
does not visit x in the interval Do the same for the interval [t — K, t — 1], 

losing another constant. The finishes the lemma. □ 

Lemma 3. Let G bea graph and let p be the heat kernel for the lazy walk on G. Let t and 
s satisfy that \ t — s\ < \ft. Then 

\p(x,x;t) — p(x,x;s)\ < C— — s I * p( X/ x;t) + Ce~ clog2 ' . 

Proof. Denote by q{x,y; t) the heat kernel for the simple random walk on G. Then 
by definition 

i 



p{x,x;t) = ^q(x,x;i)[.)2 f . 

i=0 



Writing the same formula for p(x,x;s) and subtracting we get 

\p(x,x;t)-p(x,x;s) \ <J^q(x,x;i) ( (*J2~* — (')2~ s ) = Ei + E 2 



where Ex is me sum over all [f — 2/| < \[t log t and E2 is the rest. A simple calcu- 
lation with Stirling's formula shows that 



J J V nt r V 
And with some more calculations 



r-i / .n / 2 /, ,,-\2/, f |£ - s| loe t Jf-slloe t , . 

while 



\t-2i\<^t ' M St Vt 



E 2 <C £ e -c(t-20 2 /f < cg-clo^t 
|f— 2£|>\/f logf 

proving the lemma. □ 

Proof of the theorem. Abusing notations, for subsets HcZ" 1 we will not distinguish 
between H as a set and as an induced subgraph of Z d (d will be 22). For the con- 
struction we need a sufficiently fast increasing sequence flj < «2 < • • • . We further 
assume that aj; are all even and that fljt-i divides \a^. It would have probably been 
enough to choose % = 2 fl,r -i, but it turns out simpler to choose the a^ inductively, 
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and we perform this as follows. Let a\ = 2. Assume now a.\, . . . , a/ c _i have been 
defined. Define, for integers m < ll and i £ {1, . . . ,22}, 



Ql,m,i :=|#€l Z 22 : |n, mod l\ < mj 

Q/,m := U Pi QiW- 



Ic{l,...,22}tGl 

:|=i9 

Here n mod Z € { — |_^-J/ • ■ • ' L2J }• m words, Q/ /m/ ; is a 21-dimensional subspace 
of Z 22 orthogonal to one of the axes, fattened up by 2m + 1 (a "slab") and repeated 
periodically with period /. Q; „, is the collection of all 3-dimensional subspaces, 
fattened and repeated similarly. The particular point is in fact contained in all 
( 2 3 2 ) of these 3-dimensional slabs which will be a little inconvenient, so let us shift 
Qi,m by 

v(m) = y \m, . . . , \m, 0, . . . , 

3 times 19 times 

In the shifted set Qj m + v(m) the geometry of the neighbourhood of is simpler, 
it is contained in just one slab. Compare to the figure on page 3. The point x is in 
the middle of a fat column and not at the intersection of a column and a band. 

We want to use these graphs with / = cij and m a little larger than a,-_i . Precisely, 
define 

/ 

b j = E a k- 

k=l 

With this choice of bj, Q(aj,b;_\,i) contains only complete components of Q(fl/, 
z) for each I < j. Each such component is either contained in Q(a,-,by_i,i) 
or disjoint from it. The same holds for the translations Q(fl/, + v(a{) (we 

need here that fl; > 4«;_ 1 so let us assume this from now on). For brevity, define 

Vj = v(aj). 

We may now define two graphs, denoted by H? , and Hi* ("e" and "o" stand- 
ing for even and odd) by 

2<;'<fc-l 
/ even/odd 

We shall usually suppress the k — 1 from the notation. It is not difficult to check 
that H e /° are both roughly isometric to Z 22 (the rough isometry constant depends 
on the "past" a\, . . ., Therefore by lemma 1 we see that there exists an a 

(again, depending on the past) such that 

p He/0 (x,x;t) < oct~ n . (5) 

Examine now the graphs 

F k-° : = H k-°i n {* e z22 : \ n i\ ^ h-i vf = 4, ... ,22} . 

pe/o are b om roughly isometric to 1? so by lemma 2 there exists some /5 such that 
P f e/o (the walk returns to for the first time at f) > ^~ 3/2 - (6) 
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Define 7/ c := [max{a, /3}] (as usual, [■] stands for the upper integer value). With 
these we can define a k to be any even number satisfying a k > + ^a k _\ and 
such that a k _\ divides \a k . This completes the description of the induction, and 
we define 

H e J°-.= n (Q w -a + fy). 

j even/ odd 

These graphs will be the two halves of our target graph G. 

Before continuing, let us collect some simple facts about H%1°: 

(i) Htl ° is connected — in fact we used this indirectly when we claimed 
are roughly isometric to Z 22 . 

(ii) ° are transient — this follows because each contains a copy of Z 3 
(namely {«4 = ■ ■ ■ = n 22 — 0}) and transience is preserved on adding 
edges. This last fact follows from conductance arguments, see e.g. [DS84]. 

Define therefore the escape probabilities 

£ e/ °:=rL/o(R(f)^0Vf>0) 

(R being the random walk on the graph) and let J := i min{e e , £ }. Define the 
graph G by connecting H%, to H£, with a single edge between the two with 
weight 5. Define x := e and y = 0°. This is our construction and we need to 
show (2), which will follow if we show that, for k sufficiently large, 



p(x,x;t 2k ) >3p{y r y;t 2k ) 
p(x,x;t 2k+1 ) < \p{y,y;t 2k+1 ) t 



h ■= ll ( 7 ) 



We will only prove the even case, the odd will follow similarly. 
Examine therefore p(x, x; t 2k ). Since a 2k > t 2k we get that 

H e J° n [-t 2k , t 2k } 22 = H e 2 [° n H», t 2k } 22 

or in other words, the steps after 2k do not effect us at all. Similarly it is possible 
to simplify the last stage namely 

H e 2k n f 2 d 22 = Jfffc.! n (Q^,^ + n [-t 2fc , t 2fc ] 22 = 

= Htk-l n {" e Z 22 : |n;| < Vi = 4, ... ,22} = F^.j 

(here is where these translations by t>y are used). By (6), 

1 

Pq{x, x; t 2k ) > -Ph 1 " (R returns to x for the first time at t) > 

(6) 1 1 7M 

* 2^ = 2** ' (8) 

(the i comes from the first step). 

To estimate p(y,y; t 2k ) we divide the event {R(f2fc) = y} according to whether 
R "essentially goes through x" or not. Formally, denote by T\ and T 2 the first and 
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last time before t 2k when R(T) = x (if this does not happen, denote T\ = oo and 
T 2 = —00). Then we define 

pi := P(Ti > 72*, R(t») = y) P2 := P(T 2 < tat " 72*, R(t») = y) 
P3 := F(Ti < 72/c, T 2 > t 2 fc - 72*;/ K(*2Jfc) = y) 

so that p(y,y; f 2/c ) < pi + p 2 + P3- 

Now, pi and p 2 are easy to estimate. As above we have 

H 2k n [-hhhk] 22 = H%k-t n [-hk'hk] 22 
so (5) applies and we get 

F v HO jR(t) = y) < l2k r n Vt<t 2k . (9) 

Therefore 

Pi < E P ( T i = f ' R ( f 2fc) = y) + P(Ti = oo, R(t 2k ) = y)< 

t=72k 
tlk-2 

< E V H ojR(t)=y)+P H ojR(t 2k )=y)< 

t=7at~l 

(9) ta~ 2 m 

< E 72)c-f- 11 +72 fc -f 2/c 11 <C7 2 f = Q 2/c 9/4( = ) O (p(x,x;0) (10) 
f=72*-l 

and similarly for p 2 . As for P3, we have 

00 \ $ 1 

< 72fc) < ( E P Hg= ( r visits V 1 times before 72/c)) • 6 < ~s < ? 

1=0 j e l 

and similarly (using time reversal) for P(T 2 > t 2k — J 2k )- Hence we get 

1 

Vr, < - max p(x,x;s) 

and by lemma 3, 

P3<ip(x,x; f2/c) (l +0 (^^ 
(8) 1 

< -£P(x,x;t 2 k) (l + o(l)). 
With the estimate (10) for pi and the corresponding estimate for p 2 we get 

p{y,y;hk) < p(*/*;t») Q + • 

A completely symmetric argument shows that at t 2k+ i the opposite occurs: 

p(x,x;f 2 , c+ i) < p(y,y,t 2k+1 ) fl+ (l)J 

proving the theorem. □ 



+ Q( e -clog 2 f 2 ,) < 
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Remark. If you want an example with unweighted graphs, this is not a problem. 
H e and H° are already unweighted, so the only thing needed is to connect them, 
instead of with an edge of weight 5, with a segment sufficiently long such that the 
probability to traverse it is < 5. The proof remains essentially the same. 

2. Manifolds 

We would like to show a Manifold M and two points x, y G M such that the 
heat kernel on M satisfies 

p(x, x; t) 

as f — » oo. Here is how one might translate the construction of our theorem to the 
settings of manifolds. The dimension of the manifold plays little role, so we might 
as well construct a surface. 

For a subset H C Z 22 one can associate a manifold H* by replacing each vertex 
v G H with a sphere v* and every edge with a empty, baseless cylinder. Since the 
degree of every vertex in H is < 44 we may simply designate 44 disjoint circles on 
S 2 and attach the cylinders to the spheres at these circles. This is reminiscent of the 
well-known "infinite jungle gym" construction, see some lovely pictures in [ON]. 
The exact method of doing so is unimportant since anyway the manifold that we 
get is roughly isometric to H, considered as an induced subgraph of Z 22 (one of 
the nice features of rough isometry is that continuous and discrete objects may be 
roughly isometric, rough isometry inspects only the large scale geometry). Clearly 
H* can be made C°°. 

One can then construct a (possibly different) sequence and two manifolds 
* with the only difference is that the a and $ must satisfy (5) and (6) for our 
choice of the * operation. This should be possible since (H^ )* and (F^ )* are 
roughly isometric to Z 22 and Z 3 respectively Instead of Delmotte one can use the 
manifold version [SC95] (or rather, [D99] is the graph version of earlier results for 
manifolds, see [SC95] for historical remarks). 

The argument for the transience of (H^°)* should also be direct translation. 
Each contains a submanifold (with boundary) which is roughly isometric to Z 3 
and therefore is transient. Since transience is equivalent to the fact that for some 
c > every function which is 1 at x and at infinity satisfies that the Dirichlet form 
(V/, V/) > c, and since restricting to a submanifold only decreases the Dirichlet 
form, we see that (H e J )* are transient. Denote by 

e e/0 = inf P* (W[0, oo) n (O e/o )* = <Z>) 

where W here is the Brownian motion on the manifold (H^ )*; and where the 
infimum is taken over all x belonging to a sphere v* where v is some neighbor 
of O e// ° in H^/°. One can now define 5 = 1 min(£ e , e°) and connect fj e to 0° by a 
cylinder sufficiently thin (or sufficiently long) such that the probability to traverse 
it in either direction before reaching a neighboring sphere is < 5. This concludes a 
possible construction of a manifold M, and one may take x to be an arbitrary point 
in (0 e ) * and y and arbitrary point in (0°) . 

The proof that p(x, x; t ) / p(y,y, t) does not converge should not require signif- 
icant changes. We note that in our case it is possible for a Brownian motion at 
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time t to exit the box [— f, t] 22 , but it is exponentially difficult to do so. Hence, for 
example, instead of (8) we get 



for k sufficiently large. Another point to note is that lemma 3 needs to be replaced 
with an appropriate analog. 
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